Corpus: ukr-ua_web_2015_100K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 93 96 98 99 99
1000 791 950 975 988 996
10000 6821 9324 9762 9865 9891
100000 42463 83424 94881 97797 98491
1000000 42464 83425 94882 97798 98492


Zipf's diagram for sentence endings


Gnuplot diagram

19106 msec needed at 2018-06-29 12:21